Model Selection

Fine-grained alignment

# Fine-grained alignment

FG-CLIP is a fine-grained vision and text alignment model that achieves global and region-level image-text alignment through two-stage training, enhancing fine-grained visual understanding ability.

Multimodal Alignment

Transformers English

Wspalign Xlm Base

WSPAlign is a weakly supervised large-scale span prediction-based word alignment pre-training model that supports word alignment tasks for multiple language pairs.

Machine Translation

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase